Perfect Failure Detection in Timed Asynchronous Systems
نویسنده
چکیده
Perfect failure detectors can correctly decide whether a computer is crashed. However, it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed asynchronous systems with hardware watchdogs. The two main system model assumptions are (1) each computer can measure time intervals with a known maximum error, and (2) each computer has a watchdog that crashes the computer unless the watchdog is periodically updated. We have implemented a system that satisfies both assumptions using a combination of off-the-shelf software and hardware. To implement a perfect failure detector for process crash failures, we show that in some systems a hardware watchdog is actually not necessary.
منابع مشابه
Enforcing Perfect Failure Detection
Perfect failure detectors can correctly decide whether a computer is crashed. However; it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed distributed systems with hardware watchdogs. The two main system model assumptions are ( I ) each computer can measure time intervals with a known maximum error, a...
متن کاملA Comparison of Timed Asynchronous Systems and Asynchronous Systems with Failure Detectors
Failure detectors were introduced as a concept to provide processes with information about failures. We show in this paper that failure detectors can also provide information about time. In particular, we demonstrate that one can define a failure detector that provides a process with the same properties as a hardware clock. This allows us to demonstrate that all properties – except process reco...
متن کامل(Im)Possibilities of Predicate Detection in Crash-Affected Systems
In an asynchronous system, where processes can crash, perfect predicate detection for general predicates is difficult to achieve. A general predicate thereby is of the form , where and refer to a normal process variable and to the operational state of that process, respectively. Indeed, the accuracy of predicate detection largely depends on the quality of failure detection. In this paper, we in...
متن کاملDistributed Predicate Detection in a Faulty Environment
There has been very little research in distributed predicate detection for faulty, asynchronous environments. In this paper we deene a class of predicates called set decreasing predicates which can be detected in such an environment. We introduce a set of failure detectors called innnitely often accurate detectors which are implementable in asynchronous systems. Based on these failure detectors...
متن کاملFailure Detection in Asynchronous Distributed Systems
Being able to detect failures is an important issue in designing fault-tolerant distributed systems. However, the actual behaviour of a system limits the ability to provide such a mechanism. From one extreme of the spectrum, synchronous systems (i.e., with bounded message transmission delay and processing times) allow for the construction of perfect failure detection based simply on local timeo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Computers
دوره 52 شماره
صفحات -
تاریخ انتشار 2003